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Abstract 

We give explicit polynomial-sized (in n and k) semidefinite representations of the hyperbolic- 
ity cones associated with the elementary symmetric polynomials of degree k in n variables. These 
convex cones form a family of non-polyhedral outer approximations of the non-negative orthant 
that preserve low-dimensional facial structure while successively discarding high-dimensional 
facial structure. 

More generally we construct explicit semidefinite representations (polynomial sized in k, m, 
and n) of the hyperbolicity cones associated with /cth directional derivatives of polynomials of 
the form p(x) = det(^™ =1 AiXi) where the Ai are m x m symmetric matrices. These convex 
cones form an analogous family of outer approximations to any spectrahedral cone. 

Our representations allow us to use semidefinite programming to efficiently solve the linear 
cone programs associated with these convex cones as well as their (less well understood) dual 
cones. 

1 Introduction 

Expressing convex optimization problems in conic form, as the minimization of a linear functional 
over an affine slice of a convex cone, has been an important method in the development of modern 
convex optimization theory. This abstraction is useful (at least from a theoretical viewpoint) 
because all that is difficult and interesting about the problem is packaged into the cone. The 
conic viewpoint provides a natural way to organize classes of convex optimization problems into 
hierarchies based on whether the cones associated with one class can be expressed in terms the 
cones associated with another class. For example, semidefinite programming generalizes linear 
programming because the non-negative orthant is the restriction to the diagonal of the positive 
semidefinite cone. 

When faced with a convex cone the geometry of which is not well understood, we stand to 
gain theoretical insight as well as off-the-shelf optimization algorithms by representing it in terms 
of a cone with known geometric and algebraic structure such as the positive semidefinite cone. 
Terminology is attached to this idea, with a cone being spectrahedral if it is a linear section of 
the positive semidefinite cone, and semidefinitely representable if it is a linear projection of a 
spectrahedral cone. The efficiency of a semidefinite representation is also clearly important. If we 
can write a cone as the projection of a slice of the cone ofmxm positive semidefinite matrices, 
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we say it has a semidefinite representation of size m. Many convex cones have been shown to be 
semidefinitely representable using a variety of techniques (see [4] and [13] for contrasting methods 
and examples). 

The classes of semidefinitely representable cones and spectrahedral cones are distinct, although 
semidefinitely representable cones are perhaps more natural from the point of view of optimization. 
Unlike spectrahedral cones, the class of semidefinitely representable cones is closed under duality. 
Furthermore, a semidefinite representation of a cone suffices to express the associated cone program 
as a semidefinite program. 

The hyperbolicity cones form a large family of convex cones (constructed from certain multi- 
variate polynomials) that includes the positive semidefinite cone, as well as all homogeneous cones. 
While it has been shown (by Lewis et al. [11] based on work of Helton and Vinnikov [8]) that all 
three-dimensional hyperbolicity cones are spectrahedral, little is known about semidefinite repre- 
sentations of higher dimensional hyperbolicity cones. Furthermore while hyperbolicity cones have 
very simple descriptions, their dual cones are not well understood. 

In this paper we give explicit, polynomial-sized semidefinite representations of the hyperbolicity 
cones associated with the elementary symmetric polynomials. These cones form a family of non- 
polyhedral outer approximations to the non-negative orthant known as derivative relaxations. Our 
semidefinite representations naturally generalize to analogous families of derivative relaxations of 
spectrahedral cones. 

1.1 Hyperbolic polynomials and hyperbolicity cones 

A homogeneous polynomial p of degree m in n variables is hyperbolic with respect to e G M. n if 
p(e) ^ and if for all x G W 1 the univariate polynomial t \— > p{x — te) has only real roots. These 
m real roots, denoted X e {x) G M m , are called the characteristic roots of x (with respect to e). 
Garding's foundational work on hyperbolic polynomials [6] establishes that if p is hyperbolic with 
respect to e then the connected component of {x G M n : p(x) ^ 0} containing e is an open convex 
cone. This cone is called the hyperbolicity cone corresponding to (p, e). We denote it by A ++ (p, e), 
and its closure by A + (p, e) . 

Since p is hyperbolic with respect to e if and only if — p is hyperbolic with respect to e, we 
assume, throughout, that p(e) > 0. 

Basic examples: 

• The polynomial p(xi, X2, ■ ■ ■ , x n ) = x\Xi- ■ ■ x n is hyperbolic with respect to e = l n := 
(1,1,..., 1). The vector of characteristic roots is A e (x) = x. The associated closed hy- 
perbolicity cone is the non- negative orthant. 

• Let Ai, A2, ■ ■ ■ ,A n be symmetric m x m matrices and e G M n such that Ya=1 e i^i is posi- 
tive definite. The closed hyperbolicity cone associated with the polynomial p(xi, . . . ,x n ) = 
det(^" =1 XiAi) and the direction e is the spectrahedral cone A + (p, e) = {x G M n : Ya=i x i^i ^ 
0}. 

Roots vs coefficients: Throughout this paper we often pass back and forth between root-based 
and coefficient-based views of hyperbolic polynomials and their hyperbolicity cones. Indeed if p 
has degree m and is hyperbolic with respect to e G W 1 then the following two equivalent conditions 
characterize x G A + (p,e): 
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• the roots of 1 i— > p(x — te) are all non-negative, i.e. 

A+(p, e) = {x G M n : A e (x) G M™} (1) 

• the coefficients of t h-» p(x + te) = p(e)t m + a>i{x)t m ~ % are all non-negative, i.e. 

A+(p,e) = {x G M n : ai(x) > 0, a 2 (x) > 0, a m (x) > 0}. (2) 

This equivalence is the specialization of Descartes' classical 'rule of signs' to the case of polynomials 
with real roots [15, Theorem 20]. Note that the coefficients at(x) appearing in (2) are themselves 
homogeneous polynomials of degree i in x. 

1.2 Derivative relaxations 

If p is hyperbolic with respect to e then (essentially by Rolle's theorem) the directional derivative 
of p in the direction e, viz. 

P { e\ x ) ■= ^;P{x + te) 

is also hyperbolic with respect to e, a construction that goes back to Garding [6]. If p has de- 
gree m, by repeatedly differentiating in the direction e we construct a sequence of polynomials 
P-iPe iPe 2 > • • • ,Pe m ^ each hyperbolic with respect to e. In terms of coefficients, if p(x + te) = 
p(e)t m + YaLi CLi(x)t rn ~ l then the A;th derivative essentially removes high degree terms, i.e. 

pf\x + te) = c a m - k (x) + cia m _ fc _i(x)t H h c m ^ k p(e)t m ~ k 

where c% = (k + i)\/i\ > 0. Hence the corresponding hyperbolicity cone is 

A + (p { e k \e) = {x G M m : a x {x) > 0, o 2 (x) > 0, a m „ fc (x) > 0} 

and can be obtained from (2), the coefficient-based description of A+(p,e), by removing k of the 
inequality constraints. As a result, the hyperbolicity cones A + (p^\ e), which we abbreviate by 
A^\p, e), provide a sequence of outer approximations to the original hyperbolicity cone that satisfy 

A + (p,e) C A^(p,e) C ••• C A^foe). 

The last of these, A^™ ^(p, e), is simply the closed half-space defined by e. The work of Renegar 
[15] highlights the many nice properties of this sequence of approximations. 

Note that we abuse terminology by referring to the cones A^\p, e) as derivative relaxations of 

the hyperbolicity cone A + (p, e). The abuse is that A+\p, e) does not depend only on the geometric 
object A+(p, e) but on its particular algebraic description via p and e. 

1.3 Derivative relaxations of the non-negative orthant 

In the case of p{x) = x\x<i • ■ ■ x n and e = l n , we have that 

p(x + te) = a n (x) + cr n _i(x)t + • • • + cii(x)t n_1 + t n 
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where <7&(a?) = Xa<ii<-<j < n x h ' ' ' x ik denotes the feth elementary symmetric polynomial in n 
variables. As such the kth derivative relaxation of the non-negative orthant, for which we reserve 
the special notation M™'^, has the following two equivalent descriptions: 

= {j, 6 |i . all the roots of t h-> ~£<j™(x — te) are non-negative} (3) 
R™'( fc ) = { x e R n : o-i(x) > 0, <7 2 (aj) > 0, a n - k {x) > 0}. (4) 

Consistent with these descriptions we define R™'^ := W 1 . 

We emphasise the cones IR™'^ because the derivative relaxations of any hyperbolicity cone can 
be expressed in terms of the derivative relaxations of the orthant via the following (an immediate 
consequence of [15, Proposition 18]). 

Proposition 1. If p has degree m and is hyperbolic with respect to e G R n then 

Af J (p,e) = {x G R n : A e (x) G M+' (fc) } . (5) 

Thus, having a semidefinite representation of R?'^ 8 ' together with a semidefinite characteriza- 
tion (in some appropriate sense) of X e (x), allows us to construct a semidefinite representation for 
A<%e). 

1.4 Results 

In this paper we construct two different explicit semidefinite representations of the derivative re- 
laxations of the non-negative orthant. Our root-based representation (see Section 3.1) is based on 
the characterization of M™'^ in (3). Our coefficient-based representation (see Section 3.2) is based 
on the characterization of R?'^ in (4). Our constructions establish the following result. 

Theorem 1. The derivative relaxations of the non-negative orthant have semidefinite representa- 
tions of polynomial size, i.e. for each n G N and 1 < k < n, the cone M™'^ is a projection of a slice 
of the cone ofm(n, k) x m(n, k) positive semidefinite matrices where m(n, k) £ 0(n 2 min{/c, n — k}). 

n (k) 

While both of our representations of M + are polynomial sized in n and k, the root-based 
representation is smaller when k is small while the coefficient-based representation is smaller when 
n — k is small. (Hence the dependence on mm{k, n — k} in the statement of Theorem 1.) 

Using Proposition 1 it is fairly straightforward to generalize these results to construct semidef- 
inite representations (of size polynomial in k,m, and n) of the cones A+\p,e) where p(x) = 
det {Y^i=i AiXi), the A\, . . . ,A n are symmetric m x m matrices, and e is a direction of hyperbol- 
icity for p. We discuss this in more detail in Section 4, and present an example illustrating a 
construction of this type. 

1.5 Related work 

Previous work has also focused on semidefinite representations of the derivative relaxations of the 
orthant. Zinchenko [20] used a decomposition approach to give semidefinite representations of 
M™'^ and its dual cone. Sanyal [17] subsequently gave spectrahedral representations of M™'^ and 

IR + and conjectured that all of the derivative relaxations of the orthant admit spectrahedral 
representations. Recently Branden [2] settled this conjecture, giving representations of each of 
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1 as the intersection of a subspace with the cone of m x m 

n—k 



the cones IR n >( fc ) for k = 1, 2, . . . , n 

positive semidefinite matrices for m ~ n n ~ k . While this is of considerable theoretical interest, these 
representations (unlike ours) are not practical for optimization due to their prohibitive size. 

The spectrahedral representation of R™'^ given by Sanyal was already implicitly given in the 
work of Choe et al. [3] that studies the relationships between matroids and hyperbolic polynomials. 
Choe et al. observe that if Ai is a regular matroid represented by the rows of a totally unimodular 
matrix V then &et(V T diag(x)V) is the basis generating polynomial of A4. In particular, the 
uniform matroid U^ 1 is regular and has a n -\{x) as its basis generating polynomial, yielding a 

symmetric determinantal representation of er n _i(x) and hence a spectrahedral representation of 

jjn,(n-l) 

From a computational perspective, Giiler [7] showed that if p has degree m and is hyperbolic 
with respect to e then log p is a self-concordant barrier function (with barrier parameter m) for the 
hyperbolicity cone A + (p,e). As such, as long as p and its gradient and Hessian can be computed 
efficiently, one can use interior point methods to minimize a linear functional over an affine slice 
of A_|_(p, e) efficiently. Renegar [15, Section 9] gave an efficient interpolation-based method for 
computing p{ k ^ (and its gradient and Hessian) whenever p (and its gradient and Hessian) can be 
evaluated efficiently. Giiler and Renegar's observations together yield efficient computational meth- 
ods to optimize a linear functional over an affine slice of a derivative relaxation of a spectrahedral 
cone. Our results complement these, giving an efficient method to solve optimization problems of 
this type using existing numerical procedures for semidefinite programming. 



1.6 A semidefinite representation of 



p5,(2) 



As a concrete example of our representations, we give the coefficient-based representation of . 
the first case not covered by the previous work on this topic (excluding Branden's). 



,5,(2). 



Example 1. For each n let Q n = [l n /^/n V^] denote an n x n orthogonal matrix with first 
column (1,1,..., 1) I \fn. Then 



x £ 



,5,(5-3) 



such that 



3y«eR|, ^^f'el, M« Z<£\ zf G S% 
3y (2) G Rf, 4 2) G M > m(2) > z ? ] e S l 



Q 5 diag(x)Q 5 h 





AfW 
1 



tr(MW) = 



(i) 



i=l 

M« h y^I 
for i = 2, 3: 



Qldiag(y«)Q 4 ^ 





Af( 2 ) 



tr(M( 2 )) = J2 



(2) 



i=i 

for i = 2: 

M^^zPl-zP 



and 
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(1) 



tr(^ 1} ) > £ 



(i) 



(2) 



tr(^ (2) )>Ef. 



(2) 



(2) . (2) . (2) . „ 
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The representation as presented here looks quite complicated only because, unlike elsewhere 
in the paper, we have chosen to write it out rather explicitly, ignoring simplifications that can be 
made (see Section 5) and abstractions (see Section 2) that allow for more compact expression of 
the same representation. 



1.7 Notation 

Here we define notation not explicitly defined elsewhere in the paper. 



Sets: Let S n and <S" denote the sets of n x n symmetric and symmetric positive semidefinite 
matrices respectively. We write X y Y \i X — Y G 5? for some n. We denote the cone of 
vectors in non-decreasing order by = {x G R n : x\ < X2 < • • • < x n }, and write x^ for the 
ith. smallest entry of x G M n . If C is a convex cone, we denote by C* the dual cone, i.e. the 
set of linear functionals that are non-negative on C. We represent linear functionals on W 1 using 
the standard Euclidean inner product, and linear functionals on S n using the trace inner product 
(X,Y) = ti(XY). Finally, the closure of a set S is denoted cl(S'). 



Maps: If X G S n let \{X) G denote the sorted eigenvalues of X. If x G 1" then diag(x) G S n 
is the symmetric matrix with diagonal entries given by x. If X G S n then diag(X) G R n is the 
vector of diagonal elements of X. The usage will be clear from the context. 



2 Background on spectral constraints and semidefinite represen- 
tations 

2.1 Spectral constraints 

Our semidefinite representations are built upon semidefinite representations for convex cones (and 
their dual cones) of the form 

A -1 [C] := {X £S n : X(X) G C} 

where C C M n is a closed semidefinitely representable cone that is invariant under permutations of 
the coordinates (here we call this a permutation invariant convex cone) . We also require semidefinite 
representations of convex cones of the form 



KciC] :=cl 



t 


Tl 

u 


u 


X 


t 


Tl 

U 


u 


X 



G S n+1 : t > 0, X - uu T /t G \~\C][ 

G S n+l :t>0, X G A _1 [C], tX - uu T G A^fC] 



where C 3 M™ is a closed semidefinitely representable permutation invariant cone containing the 
non-negative orthant. Observe that X — uu T /t is the Schur complement of t in the partitioned 

t vFA 

matrix and that the cone A~ [C] is the Siegel cone associated with A _1 [C] and the bilinear 

form B(u,v) = (uv T + vu T )/2. (For a discussion of Siegel cones and their relationship with 
homogeneous cones see [1], for example.) 
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2.2 Majorization 

If x,y G M n we write x < y if s is majorized by y, i.e. 

n n k k 

5Z x » = ^ y W and f orfc = l,2,...,n-l 

i=l i=l i=l i=l 

where xm < X( 2 ) < ■ • • < ^(n) denotes the entries of x sorted in ascending order. Geometrically 
x <! y if and only if a; is in the convex hull of the vectors obtained by permuting the entries of y. 
It is well known that A -1 [C] can we rewritten as 

A-^C] = {X G S n : By G C such that X(X) < y}. (6) 

To see this, on the one hand note that if X(X) G C we can take y = X(X). On the other hand if 
y G C then the convex hull of the permutations of the entries of y is also in C (by permutation 
invariance and convexity of C) and so if \(X) < y it follows that A(X) G C. 

2.3 Explicit semidefinite representations of A _1 [C] and A" 1 [(7] 

A semidefinite representation of sets of the form A _1 [C] is given in [13, Section 4.3.1]. We now 
present a slight variation that is more suited to our purposes. 

Proposition 2. If C C W 1 is a permutation invariant convex cone then X G A _1 [C] if and only if 

there exists y G C D M™, Z2, ■ ■ ■ , z n -i G K, Z2, . . . , Z„_i ^ 
such that tr(X) = ^21=1 Vh ^ — fl-^j CO 

i 

/or z = 2, . . . , n — 1, X > Z{I — Z{ and iz% — tr(Zj) > yy. (8) 

i=i 

Proof. Suppose such y, Zi and Z% exist (for i = 2, 3, . . . , n — 1). In light of (6) we need to show 
that X{X) < y. From (7) we have that Y^=i ^-(j) (X) = = YTj=iV(j) an d \i)P0 > 2/(1)- 

Furthermore, since X ^ Zjl — and Zi >z. 0, the sum of the smallest i eigenvalues (for i = 
2, 3, . . . , n — 1) of X satisfies 

i i i i 

3=1 3=1 3=1 3=1 

where the last inequality is from (8). Hence X(X) < y and so X G A _1 [C]. 

For the other direction if X G A _1 [C] it is straightforward to check that we can satisfy (7) and 
(8) by choosing y = X(X), Z{ = yi for i = 2, . . . , n — 1, and 

i-l 

Z i = ^[A {i) (X)-A a) (X)K4 
i=i 

where is an eigenvector of X corresponding to X/j\(X) and the Uj are orthonormal. □ 

When C is a permutation invariant convex cone containing the non-negative orthant we can 
give an explicit semidefinite representation of A^-fC] using our representation for A _1 [C] together 
with the following observation. 
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Proposition 3. If C C M. n is a permutation invariant convex cone such that C D WL then 



Kcic] 



t u 1 
u X 



G S n+1 : 3M G X~ l [C] s.t. 



t u 1 
u X 




M 



(9) 



Proof. First we show that Ag^fC] is contained in the right hand side of (9). Suppose [* u x ] £ 
\sc[C]. If t > we can take M = X — uu T /t G A _1 [C]. Since X — M — uu T /t = it follows by 
basic properties of the Schur complement that [ * x-M \ — ^- If t = and [ * u £ ] = [ u £ ] G A" 1 [C] 
it follows that u = and it suffices to take M = X G A _1 [C]. 

For the reverse inclusion suppose t > and there exists M G A _1 [C] such that [* x W -m] — ^- 
It follows (again from properties of the Schur complement) that X — uu T /t >z M. Since C D R+ 
implies A _1 [C] D 5™ we can conclude that X — uu T jt G A _1 [C] as required. Finally if t = and 
there exists M G A _1 [C] such that [^ X ^ T A/ ] = [°x5w] - then n = and X - M - Hence 



tX - uu T = G A -1 [C] and X G A as required. 



□ 



2.4 Duality and spectral constraints 

In order to construct semidefinite representations of the dual cones (H? )* we need the following 
duality relation for sets of the form A _1 [C] where C C W 1 is a permutation invariant convex cone. 
This is a special case of Lewis' result on the convex conjugates of convex spectral functions [10, 
Theorem 2.3]. 

Proposition 4. If C C W 1 is a permutation invariant convex cone then 

(A _1 [C])* = X-^C*} = {X G S n : X(X) G C*}. 
The analogous result for A^fC] is as follows. 
Proposition 5. If C C W 1 is a permutation invariant convex cone such that C D then 

,,T1 



(10) 



Proof. First we show that (A^fC])* is contained in the right hand side of (10). Since C D 
it follows that \£[C\ D and so that (A^fC])* C (S++ 1 )* = Now suppose that 

Ivy] g (^[C])*- Th en for any X G A _1 [C] we have that [g £] G A^fC] and so that 



(Y,X) 



r T" 

s V 




"0 


0" 


v Y 







X 



> 0. 



Hence Y G (A^C])* = A" 1 ^] by Proposition 4. 

For the reverse inclusion assume that [ s V Y ] ^ and Y G A -1 [C*]. Let [ * u x ] be an arbitrary 
element of A~ X [C]. By Proposition 3 there is M G A _1 [C] such that [ * x u f M ] h 0. Then 



+ {Y, M) > 



□ 



r T" 

s V 








r t~ 

S V 






u Y 


5 


it X 


X 


v Y 


5 


u X-M 



and so [«f ] G(A- 1 [C])*. 
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3 Semidefinite representations of 



)71,(/C) 



In this section we present our two semidefinite representations for the cones ifM^ for k = 1, 2, ... , n— 

1 corresponding to the root-based and coefficient-based characterizations of ifM^ in (3) and (4) 
respectively. 

Our two representations are recursive in nature. The root-based representation expresses K?'^ 

(k 1) n—k (0) 

in terms of IR + , working towards the base case M + , i.e. the non-negative orthant. The 

coefficient-based representation expresses R?'^ n k ^ in terms of 1 '^ n k \ working towards the base 

Tnjn— fc+l,(n— k) , 1r 

case K + , a naif-space. 

3.1 Root-based representation 

If x G R n define 

rj(x) := roots of t h-> ^o~ n (x — tl n ). 
Note that t](x) G W 1 " 1 whenever x G IR n . By the definition of rj(x), we obtain the factorization 



f t a n {x - tl n ) = (rji( x ) ~ OfeO) -*)••■ (7?„-i(a?) - t) 

= (T n -l(jl(x) - tl n -l). (11) 

Note that on the left, the elementary symmetric polynomial is in n variables, whereas on the right 
the elementary symmetric polynomial is in n — 1 variables. This immediately yields a recursive 
description of M™'^. 

Lemma 1. Ifl<k<n-l then M™' (fc) = {x G R n : rj(x) G R"" 1,(fc-1) }. 
Proof. 

x G M™'^ all roots of t \-t -rr. a n(x — tl n ) are non-negative 

at K 

d k ~ l 

-<=^ all roots of 1 1 y k _ 1 a n -i(rj(x) — il n -i) are non-negative 

where the equivalence marked with an asterisk follows from (11). □ 

Next we show that ij(x) can be written as the eigenvalues of a symmetric matrix with entries that 
are linear in x. Our argument is essentially a restatement of Sanyal's spectrahedral representation 
of Let V n denote an n x (n — 1) matrix whose columns are an orthonormal basis for the 

subspace {x G R n : ££=i x { = 0}. 

Proposition 6. // x G M n i/ieri 

^CT n (x - tl n ) = cr n „i(x - tl n ) = ndet(F n T diag(x - tl n )V n ). 

Hence rj(x) = A(V^ diag(x)V^) and 

P«.( fc ) _ J~ € R n . yJ diag(a . )K G A- 1 ^- 1 '^ -15 ]} . (12) 
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Proof. The first assertion is a slight variation on Theorem 1.1 of [17] and follows from the Cauchy- 
Binet formula. To see that rj(x) = X(V^ diag(x)V^) simply note that V^diag(cc — tl n )V n = 
di&g(x)V n — tl because V^diag(l n )V^ = V^V n = I. The recursive representation of M™'^ 
then follows directly from Lemma 1. □ 

Expanding the recursive description in (12) gives our root-based representation of R?' . The 
representation as stated in Theorem 2 can be turned into an explicit semidefinite representation by 
appealing to Proposition 2. 

Theorem 2. Suppose 1 < k < n — 1. Then x S R!J. if and only if there exist y\ G M n_1 ; 
y 2 e M.n-2 j _ ^ yk e R n-k guch that 

A(y n T diag(x)K)<yi 
diag( yi )K_i) < y 2 

A(V^ fe+1 diag(y fc -i)K-fc+i) < 

2/fc >0. 

This representation is symmetric in the Xj and polynomial-sized. Indeed each eigenvalue ma- 
jorization condition requires 0(n) additional matrix variables each of size 0(n) (by Proposition 2) 
and there are k such majorization conditions for a total size of 0(n 2 k). 



3.2 Coefficient-based representation 

A drawback of the root-based representation introduced in Section 3.1 is that it gives very compli- 
cated descriptions of R™'^ 1 k ' when k is small. Motivated by this, we now present our coefficient- 
based representation which gives simpler descriptions of R™'^ 1 k ^ when k is small. 

In this section we initially work with the open cones M"'^™ ^ and consider the closure later, 
because the open cones admit the representation (cf. (4)) 

R „,(n-fc) = {x£Rn . ( ai / a ^ x ) > o, (a 2 /a 1 )(x) > 0, . . . , {a k /a h -x){x) > 0} . 

This description is particularly nice because the functions ak+i/ak are concave on M™'^ n k ' [16, 
Theorem 7], and interact well with the operation of taking the polar derivative of a polynomial, 
which we describe next. 



Polar derivatives: Given a univariate polynomial p(t) = X]fc=o a n-kt k , of degree n with real 
roots, let p hom (s,i) = Ylk=o a n-kS n ~ k t k denote the corresponding bivariate homogeneous polyno- 
mial. Let 

V^p(t) := —p hom (l,t) = £(n - k)a n ^ k t k 

k=0 

denote the polar derivative of p. As long as a\ ^ 0, 2?P° lar is of degree n — 1. We can also think 
of the polar derivative as the composition of reversing the coefficients of the polynomial (inverting 
its roots), differentiating the result, and then reversing the coefficients again (again inverting the 
roots). As such, if p has real roots, then so does T> poUr p. 



10 



In the case where p(t) = a n (x — tl r 



n-l 



V^p(t) = ^(-l) fc (n - k)a n _ k (x)t k . (13) 

If g\(x) / then define 

rf(x) := roots of the polar derivative of t \-t a n [x — tl n ) 

so that T> polar p factors as 

V^p{t) = (-lr-V^zXt - rf^x)) ••■(*- d(x)). (14) 

If, in addition, x% ^ for all i then by the reverse-differentiate-reverse interpretation of pp° lar ; 

rf(x) = ^(x" 1 ) -1 

where x -1 G M n is the element-wise inverse of x. 

We now show how quotients of elementary symmetric polynomials and roots of the polar deriva- 
tive interact. In Lemma 2 below, note that the elementary symmetric polynomials on the left are 
in n variables and those on the right are in n — 1 variables. 

Lemma 2. If 1 < k <n — 1 and x G W 1 is such that <J\{x) ^ then 

k 

(a k+ i/a k )(x) = ^-py • (a k /a k -i)(rf(x)). 

Proof. By comparing (13) and (14) we see that for k = 1, 2, . . . , n — 1 

a k (rf(x)) = (k + l)a fc+ i(x)/cri(x) and 
cr fc _i(r/ p (x)) = cT fc (x)/cri(x). 

Computing the appropriate ratio and rearranging yields the identity. □ 

Lemma 2 provides us with another recursive description of the derivative relaxations of the 
orthant. 

Lemma 3. If 1 < k <n — 1 

R n,(n-k) ={x£R n. ^ > ^ ^ £ R n-l,(n-fc) } _ 

Proof. 

x e ^ (ai/c7o)(x) > 0, ((T 2 /ai)(x) > 0, . . . , (<7jfc/<7fc-l)(x) > 

<J=^ (<Ti/cro)(x) > and 

(<7i/<T O )0f (x)) > 0, . . . , {a k - X /a k - 2 ){rf{x)) > 
a x (x) > and r, p {x) G E+~ 1,(fc_1) . 

where the equivalence marked with an asterisk follows from Lemma 2. □ 
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We now show that if (T\{x) > then rf(x) can be expressed in terms of the eigenvalues of a Schur 



complement of a symmetric matrix with entries that are linear in x, allowing us to write 



D n,(n— k) 



in terms of A 



Here, as earlier, Q n is the orthogonal matrix Q n = [l n / \/n V n 



Proposition 7. If 1 < k < n — 1 



inn> n-1 '( n-fc )i 



x G R n : Qi diag(x)Q n G A~ [ 



Proof. By the definition of A sc [C] , the statement is equivalent to establishing that 



pn,(n— fc) 



equals 



cl< x G R n : a x (x) > 0, A V? diag(x)V n 



oi(x) 



D n— l,(n— fc) 



Assume, in addition, that a n (x) ^ so that rf\x) = r/(x ) . By Proposition 6 we have that 



rf{x) = A [(VfdiagOzT^Kr 1 ] = A ( F n T diag(x)K 



Vrt xx 1 V n 
(Ji(x) 



(15) 



The second equality holds because (Fjdiag^- 1 )^)" 1 is the inverse of the lower right submatrix 
in the partitioned matrix 

a\(x)jn x T V n j^fn 1 _ T , _u n 
Vn xj^Jn V* diag(x)K n J 

oi(x~ l )/n (x~ l ) T V n / \/n 

Vn(x^)/Vn V% diag^- 1 )^ 



and so by a property of the Schur complement [19, Theorem 1.2] can be written (after canceling a 
common factor of n) as 

(Vfdiag^)- 1 ^)- 1 = F n T diag(x)K - V^xx T V n /ai(x). 

The identity (15) extends to all x such that a±(x) > (removing the assumption a n (x) ^ 0) since 
rf{x) and the eigenvalues of the Schur complement are continuous for all such x. The result then 
follows by taking the closure in the characterization of k ^ in Lemma 3. □ 

By recursively applying Propositions 3 and 7 we could explicitly write down our coefficient- 



based representation of 



pn,(n— k) 



. Its size would be 0(n 2 (n — k)). Rather than do this we note that 



Example 1 in Section 1.6 gives a concrete illustration of this process. 



3.3 Dual cones 

If a cone is semidefinitely representable so is its dual cone and there are explicit procedures to 
take a semidefinite representation for one and produce a semidefinite representation for the other 
[13, Section 4.1.1]. In this section we note that by using the results of Section 2.4 we can give 

n (k) 

representations of the dual cones of M + that inherit the recursive structure of our original root- 
based and coefficient-based representations. 

Recall from Proposition 6 that = {x G M n : A n {x) G A^pR™ -1 '^^]} where A n (x) = 

Vj[ diag(x)V^ is an injective linear map from W 1 to 5 n_1 . In this case it follows from Lewis' result 
that 

(R r ^)* = {A* n (M) : M G A- 1 [(]R+" l!(fe " 1) )*]} 

= {diag(KMV; T ) : M G A" 1 [(R^ 1 '^^)*]}- 
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Because (M™ k '(°)y _ jg>« k we can apply this result recursively to obtain an analogue of Theorem 2 
for the cones )*. 

Theorem 3. Suppose 1 < k < n— 1. Then x G (M™'^)* if and only if there exist Mi G 5 ra_1 ,M2 6 
5 n ~ 2 , ...,M k e S n ~ k such that 

x = diag(F„M 1 K T ) 
A(Mi) < diag(K_iM 2 y„ T _ 1 ) 

A(M fc _!) < diag(K_ fc+ iAf fc V; T fc+1 ) 
M fc ^ 

To similarly dualize the coefficient based representation, recall from Proposition 7 that M™'^ n ^ = 

{leR": B n (x) G A-^" 1 *^"* 5 ]} where B„(x) = Q^diag(x)Q n is an injective linear map from 
R n to S n . Hence 

(R^))* = {£*(M) : M G (A- 1 ^ 1 '^])*} 

= {diag(Q n MQ^) : M G (Aj c 1 [M!p 1 ' ( - n ~^])*} 

which can be expanded into an explicit semidefinite representation using Proposition 5. 

4 Derivative relaxations of spectrahedral cones 

We now show how to use our polynomial-sized semidefinite representations of the cones MJ?'^ 
to construct polynomial-sized semidefinite representations for derivative relaxations of arbitrary 
spectrahedral cones. Recall from Proposition 1 that if p has degree m and is hyperbolic with 
respect to e we can express the derivative cone A^\p,e) in terms of the cone M™'^ via the 
characteristic roots A e (x). If we can represent A e (x) in terms of the eigenvalues of a symmetric 
matrix with entries linear in x, then we can use our semidefinite representation of M™'^ to give a 

semidefinite representation of A^\p, e). This is precisely the situation we are in when A+(p, e) is 
spectrahedral. 

Proposition 8. Let A 1 ,A 2 ,...,A n G S m and e G W 1 be such that Y!i=i A i e i =■ B y 0. Let 
p(x) = det(^" =1 AiXi). Then for 1 < k < m — 1 

AV% e) = {x G R n : 3y G R^ (k) s.t. A(-B -1 / 2 Q^" = i AxjB'^) < y) . 

Proof. Note that A e (x) = A(S~ 1 / 2 (^™ =1 AiXi)B~ 1 / 2 ) and so is given by the eigenvalues of a sym- 
metric matrix linearly parameterized by x. The result follows by combining Proposition 1 and 
(6). ' □ 

By also appealing to Proposition 4 a similar argument could be used to give a semidefinite 
representation for the dual cone (A+(p, e))* in terms of a semidefinite representation for (R™'^)*. 
We omit the details. 

We conclude this section with an example of these constructions. 
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Figure 1: On the left, the inner region is the 3-ellipse consisting of points with sum-of-distances to 
(0, 0), (0, 4), and (3, 0) equal to 8, i.e. the z = 1 slice of the spectrahedral cone defined by (16). The 
outer three regions are the z = 1 slices of the first three derivative relaxations of this spectrahedral 
cone in the direction (0, 0, 1). On the right are the z = 1 slices of the dual cones of the cones shown 
on the left, with dual pairs having the same shading. 



Example 2 (Derivative relaxations of a 3-ellipse). Given foci (0,0), (0,4) and (3,0) in the plane, 
the 3-ellipse consisting of points such that the sum of distances to the foci equals 8 is shown 
in Figure 1. This is one connected component of the real algebraic curve of degree 8 given by 
{(x,y) G M 2 : det E(x,y, 1) = 0} where E is defined in (16) (see Nie et al. [14]). The region 
enclosed by this 3-ellipse is the z = 1 slice of the spectrahedral cone defined by E(x, y,z) ^0 where 



E(x,y,z) 
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y 

llz — 3x 



(16) 



Note that E(0, 0,1) y and so e = (0, 0, 1) is a direction of hyperbolicity for p(x, y, z) = 
det E(x, y, z). The left of Figure 1 shows the z = 1 slice of the cone A + (p,e) and its first three 
derivative relaxations A^\p, e), A^\p, e), and A^\p, e). The right of Figure 1 shows the z = 1 

slice of the cones (A + (p, e))*, (A+\p, e))*, (A±\p, e))*, and (A+\p, e))*. All of these convex bodies 
were plotted by computing 200 points on their respective boundaries by optimizing 200 different 
linear functionals over them. We performed the optimization by modeling our semidefinite repre- 
sentations of these cones in YALMIP [12] which numerically solved the corresponding semidefinite 
program using SDPT3 [18]. 
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5 Concluding remarks 

We conclude with some comments about (the possibility of) simplifying our representations and 
some open questions. 

5.1 Simplifications 

n (k) 

If we can simplify a representation of R + for some k = i, that allows us to simplify the root-based 
representations for k > i and the coefficient-based representations for k < i. For example R™'^™ 2) 
can be succinctly expressed in terms of the second-order cone Q+ +1 = {x G R n+1 : (J^iLi x 1 ) — 
x n+1 } as 

Then we can represent A" 1 ^™'^™ 2) ] in terms of the second order cone as 

A -l[ R n,(n-2)j = { M eS n. ( M) tr(M)) G Qj 2+1 )} 

because tr(M) = £)? =1 h(M) and Ey=i M ij = E"=i Ai(M) 2 . Consequently Propositions 3 and 7 
can be used to give a concise representation of R™'^ n 3) in terms of the second order cone as 



xGM «,(n-3) ^ 3M£5 n_1 such that 



Qn diag(x)Q n y 




M 



and (M,tr(M)) € qJ* 1)2+1 . 



This should be used as a base case instead of R?'^ n X) in the coefficient-based representations. This 
would, for example, considerably simplify Example 1. Similarly M™'^ can be expressed as 

= {xeM. n : V n r diag(x)T/ n h 0} 
which should be used as the base case instead of R" in the root-based representations. 

5.2 Lower bounds on the size of representations 

The explicit constructions given in this paper establish upper bounds on the minimum size of 
semidefinite representations of R?'^. To assess how good our representations are, it is interesting 
to establish corresponding lower bounds on the size of semidefinite representations of M™'^. In 
the case of M™'^ n 1 \ a halfspace, the obvious semidefinite representation of size one is clearly of 

minimum size. Less trivial is the case of R™' (0) , the non- negative orthant. It has been shown by 
Gouveia et al. [5, Section 5] that R™ does not admit a semidefinite representation of size smaller than 
n. Hence the obvious representation of R" as the restriction of <S" to the diagonal is of minimum 
size. We are not aware of any non-trivial lower bounds on the size of semidefinite representations 
of the cones R™'^ for 1 < k < n — 2. 

The semidefinite representations of R™'^ given in this paper are equivariant in that they ap- 
propriately preserve the symmetries of R"'^ fc) . (For a precise definition see [5, Definition 2.10].) It 
is known that symmetry matters when representing convex sets as projections of other convex sets 
[9]. For example if p is a power of a prime, equivariant representations of regular p-gons in R 2 
are necessarily much larger than their minimum-sized non-equivariant counterparts [5, Proposition 
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3.5]. Given that the cones are highly symmetric, it would also be interesting to establish 

lower bounds on the size of equivariant semidefinite representations of the derivative relaxations of 
the non- negative orthant. 
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